📄 Document AI - matmat

Discussed on Hacker News

📄OCR LWN.net featured content·

FairScan 2.0 released

📄Document Digitization GitHub·

RewindOS – Searchable screen history for Linux local

Covers 2 stories including Model Context Protocol And OAuth

Discussed on Hacker News

📄Text Mining massgeneralbrigham.org·

New benchmark evaluates AI for everyday patient care

Discussed on Hacker News

🤖Advanced OCR TechCrunch·

Sarvam becomes India’s newest AI unicorn with $234 million funding round led by HCLTech

Covered by 4 sources including easternherald.com, 何夕2077的个人站

Less-relevant results

🧲Magnetic Philosophy clever.cloud·

At VivaTech 2026, Clever Cloud unveils the latest Clever AI updates, the Openvisio beta and its Ultimate Sovereignty Clause

📝Punctuation Engines arxiv.org·

Bounding Box Label Propagation for Re-Annotation of Document Layout Analysis Datasets

🔓Free and open source GitHub·

FairScan: An Android app to scan your documents

Discussed on Hacker News

📄PostScript pqpdf.com·

PDFs Don't Have One Meaning: Measuring Semantic Drift Across 24,824 Files

Discussed on Hacker News

📄Text Mining arxiv.org·

SAMA: Semantic Anchor-aligned Augmentation for Unified Low-Resource Multimodal Information Extraction

📄Text Mining arxiv.org·

Co-Scraper: query-aware DOM Pruning and Reusable Scraper Synthesis for Lightweight Web Data Extraction

📄Text Mining arxiv.org·

Applicability Condition Extraction for Therapeutic Drug-Disease Relations

📄Text Mining arxiv.org·

IUU+DB: Tracking Illegal, Unreported, and Unregulated Fishing, Seafood Fraud, and Labor Abuse through LLM-driven Information Extraction

📊Spectral Analysis arxiv.org·

Multiple cyclicity and Wavelet Decomposition with Channel Correlation for Long-term Time Series Forecasting

📜Manuscript TEI arxiv.org·

Analyzing and Encoding the Al-Mawrid Arabic-English Dictionary with the ISO Language Markup Framework and TEI Lex-0

📄Semantic Chunking arxiv.org·

ChatPlanner: A Large Language Model Framework for Personalized Public Transit Routing

🔗Hypertext Systems arxiv.org·

Semantic Reasoning in Medicine: The Role of Knowledge Graphs Across Five Key Domains

🔨Compilers arxiv.org·

Image Prompt Reconstruction Attacks on Distributed MLLM Inference Frameworks

No more posts from matmat's subscribed feeds.

Scour all 25,324 feeds Learn more about Feeds

BCL: Bayesian In-Context Learning Framework for Information Extraction

Hybrid AI Architecture, Part 1: Putting the Right Model in the Right Place

AI Document Integrity — Your AI may read a different PDF than your users

FairScan 2.0 released

RewindOS – Searchable screen history for Linux local

New benchmark evaluates AI for everyday patient care

Sarvam becomes India’s newest AI unicorn with $234 million funding round led by HCLTech

At VivaTech 2026, Clever Cloud unveils the latest Clever AI updates, the Openvisio beta and its Ultimate Sovereignty Clause

Bounding Box Label Propagation for Re-Annotation of Document Layout Analysis Datasets

FairScan: An Android app to scan your documents

PDFs Don't Have One Meaning: Measuring Semantic Drift Across 24,824 Files

SAMA: Semantic Anchor-aligned Augmentation for Unified Low-Resource Multimodal Information Extraction

Co-Scraper: query-aware DOM Pruning and Reusable Scraper Synthesis for Lightweight Web Data Extraction

Applicability Condition Extraction for Therapeutic Drug-Disease Relations

IUU+DB: Tracking Illegal, Unreported, and Unregulated Fishing, Seafood Fraud, and Labor Abuse through LLM-driven Information Extraction

Multiple cyclicity and Wavelet Decomposition with Channel Correlation for Long-term Time Series Forecasting

Analyzing and Encoding the Al-Mawrid Arabic-English Dictionary with the ISO Language Markup Framework and TEI Lex-0

ChatPlanner: A Large Language Model Framework for Personalized Public Transit Routing

Semantic Reasoning in Medicine: The Role of Knowledge Graphs Across Five Key Domains

Image Prompt Reconstruction Attacks on Distributed MLLM Inference Frameworks